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ABSTRACT 

An initial zonally symmetric quasigcostrophic potential vorticity (PV) distribution qi(y) is subjected 
to complete or partial mixing within some finite zone \y\ < L, where y is latitude. The change in 
M, the total absolute angular momentum, between the initial and any later time is considered. 
For standard quasigcostrophic shallow- water beta-channel dynamics it is proved that, for any qi(y) 
such that dqi/dy > throughout \y\ < L, the change in M is always negative. This theorem 
holds even when "mixing" is understood in the most general possible sense. Arbitrary stirring or 
advective rearrangement is included, combined to an arbitrary extent with spatially inhomogencous 
diffusion. The theorem holds whether or not the PV distribution is zonally symmetric at the later 
time. The same theorem governs Boussinesq potential-energy changes due to buoyancy mixing 
in the vertical. For the standard quasigcostrophic beta-channel dynamics to be valid the Rossby 
deformation length Lr> S> eL where e is the Rossby number; when Lr> = oo the theorem applies 
not only to the beta-c hannel, but also to a single barotropic layer on the full sphere, as considered 
in the recent work of iDunkerton and Scottl on "P V staircases" . It follows that the M-conserving 
PV reconfigurations studied by those authors must involve processes describable as PV unmixing, 
or antidiffusion, in the sense of time-reversed diffusion. Ordinary jet self-sharpening and jet-core 
acceleration do not, by contrast, require unmixing, as is shown here by detailed analysis. Mixing 
in the jet flanks suffices. The theorem extends to multiple layers and continuous stratification. A 
least upper bound and greatest lower bound for the change in M is obtained for cases in which qi 
is neither monotonic nor zonally symmetric. A corollary is a new nonlinear stability theorem for 
shear flows. 



1. Introduction 

Ideas about the turbulent mixing of vorticity and po- 
te ntial vo r ticity (PV) , g oing back t o the p ioneering wor k 
of iTavlorl (1l915L Il932h . iDickinsonl (Il969h . iGreenl (Il970h . 
and lWelanderl(ll973l) are an important key to understand- 
ing such phenomena as Rossby- wave "surf zones" , jet self- 
sharpening, and eddy-transpo rt barriers. For a review see 
iDritschel and Mclntvrel (120081. hereafter DM08); a l so, fo r 
exam ple. iKillworth arid Mclntvrel (Il985l). iHughesI (1 19961) 
Held! (|200lUMcintvrel (l2008h . lEslerl (l2008allbh . and lBtihler 



(2003). A key point is that PV mixing generically re- 
quires angular-momentum changes. In the real world those 
changes arc usually mediated by, or catalyzed by, the radi- 
ation stresses or Eliassen-Palm fluxes due to Rossby waves 
and other wave types, including the form stresses exerted 
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across undulating stratification surfaces. Usually, there- 
fore, there is no such thing as turbulence without waves. 

PV mixing by baroclinic and barotropic shear instabil- 
ities depends on radiation stresses internal to the system, 
mediating angular-momentum changes that add to zero. 
Cases like that of Jupiter's stratified weather layer prob- 
ably depend on form stresses exerted from below, as is 
known to be true of the terrestrial stratosphere. 

Consider for instance the quasigeostrophic thought ex- 
periment shown in Fig.[TJt. This is an idealization of Rossby- 
wave surf-zone formation. An initially linear PV profile 
(thin line) is mixed such that the PV becomes uniform 
within a finite latitudinal zone \y\ < L (thick zigzag line). 
The mixing is assumed to be conservative in the sense that 



dxdy Aq = 



(1.1) 



where q is the quasigeostrophic PV and Aq its change due 
to mixing; dxdy is the horizontal area element. It is well 
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FlG. 1. Examples of initial and final zonally symmetric PV 
profiles (thin and thick lines respectively). For each initial pro- 
file the PV increases linearly with latitude y. The examples 
could represent PV distributions in a quasigeostrophic shal- 
low water system, in a nondivergent barotropic system, or in 
a single layer within a multilayered or continuously stratified 
system. The angular-momentum changes AM are respectively 
negative, positive, and zero in cases (a)-(c), all of which sat- 
isfy Hl.l[l. Cases fb) and (c) require unmixing, or anti-diffusion. 
iDunkerton and Scottl ()2008h restrict attention to cases like (c). 



known that, according to standard quasigeostrophic the- 
ory, the resulting change AM in the total absolute angu- 
lar momentum M is negative or retrograde, in this special 
case with the initial profile linear in j/Q Such angular- 
momentum deficits are key to understanding why, for in- 
stance, breaking stratospheric Rossby waves gyroscopically 
pump a Brcwer-Dobson circulation that is always pole- 
ward and never equatorward. The troposphere exerts a 
persistently westward form stress on the stratosphere. The 
physical reality of such surf-zone formation events and their 
tendency to mix PV has been verified in a vast number of 
observational and modeling studie s, including studi es of 
the stratospheric ozone layer (e.g.. lLahoz et al.ll2006l and 
references therein). 

In an interesting rece nt paper in this journal, 
Dunkerton and Scottl (|2008l hereafter DS08), consider a 
class of PV reconfigurations in a single layer on the sphere, 
with zonally symmetric initial and final states, satisfying 



x For an explicit demonstration, see, e.g., DM08 Eqs. (7.1)-(7.2) 
and below Eq. (A. 4), noting that the integration by parts at the 
penultimate step is valid both for bounded and unbounded beta 
channels provided that the chan ge Ail in the z onal-mean zonal flow 
vanishes at the side boundaries (Phillips 1954). For the unbounded 
channel, AM is entirely due to the ageostrophic mass shift associated 
with the northward residual circulation, since Au integrates to zero. 
For the bounded channel there are contributions both from the mass 
shift and from Au. 



and constructed so as to make AM = 0. In DS08 the dy- 
namics is nondivergent barotropic. That is, the Rossby 
deformation length Lo = oo, and q is the absolute vortic- 
ity As illustrated in DS08, the constraint does not by 
itself dictate the sign of AM. However, in view of the ubiq- 
uity of radiation stresses in real atmospheres and oceans, 
one is led to question whether the assumption AM = is 
a natural one for realistic models. 

Figure [TJd shows a simple case where AM is positive 
and Fig. [Ik a case where AM is zero as in DM08. Both 
these cases must involve unmixing, or anti-diffusion. To go 
from the initial to the final state in Fig. [TJd or Fig.[JJ;, one 
must transport q nonadvectively against its local gradient, 
at least in some locations (x,y). Such locally countergra- 
dicnt transport seems unnatural, at least as a persistent 
phenomenon in a model free of gravity wave stresses. 

To exclude such countcrgradicnt transport we will re- 
strict the PV reconfigurations, throughout this paper, not 
only to respect (|l.ip but also to be describable as "gener- 
alized partial mixing" , or "generalized mixing" for brevity. 
This will be made precise in section [21 using the stan- 
dard "mixing kernel" or "redistribution function" formal- 
ism, but in essence means that no unmixing is allowed. 
With that restriction, and a nonvanishing change Aq in 
the PV profile, we will prove a theorem stating that AM 
will always be negative, as it is in the special case of Fig.QJi, 
provided only that the initial PV profile is zonally symmet- 
ric and monotonically increasing in y. In all other respects 
the initial profile is arbitrary. 

This theorem — which we designate as "basic" since it 
underpins the rest of our analysis — has been proved in 
several different ways. In section [5] we give what we think 
is the most readable of these proofs, after relating AM 
to Aq in sections [3] and [4] Section [6] points out that the 
basic theorem has an alternative interpretation in terms of 
potential energy and available potential energy. 

Central to the proof in section [5] is an intrinsically non- 
negative "bulk displacement function" constructed from 
the redistribution function. Its physical meaning is briefly 
discussed in section [7] Appendix A presents one of the 
alternative proofs, based on a second, quite different non- 
negative function. That function is related to the so-called 
momcntum-Casimir invariants of Hamiltonian theory and 
therefore math ematically relate d , also, to energy-Casimir 
invariants (e.g.. Ishepherdlll993l ) again connecting with the 
theory of available potential energy. This second nonncg- 
ative function is constructed from the initial PV profile 
rather than from the redistribution function. 

The upshot is that from section [S] and Appendix A 
we have two entirely different proofs not only of the sign- 
definitcness of AM, but also of the sign-definitcness of the 
potential energy change due to generalized vertical mix- 
ing of an initially stable stratification. T his generalizes 
classical results both on vortex dynamics ( Arnol'd 19651) 
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and o n available potential energy (jHollidav and Mclntvre 



19811) , beyond the Hamiltonian framework. The potential- 



energy interpretation applies to a Boussinesq model with a 
linear equation of state. The proofs in sections [5] and Ap- 
pendix A provide us with two completely different types of 
sign-definite integral formulae, typified by (|5.6[) and (|A3I) 
below, for AM and for the analogous sign-definite change 
in potential energy. 

As summarized in section [3] the basic theorem covers 
three classes of model system: first, a shallow-water beta 
channel, second, a stratified quasigeostrophic beta channel, 
and third, the system considered in DS08 — a sphere with 
Ld =00. Section |8j also points out that the basic theorem 
provides, as a corollary, a substantial generalization of the 
Charney-Stern shea r-flow stability theorem, related also to 
the classical work of lArnol'dl ( 1965t ). 

Section [5] presents a generalization of the basic theorem 
to cases in which the initial PV profile is neither monotonic 
nor zonally symmetric. 

Sections QJJ] and [TTJ discuss how the basic theorem ap- 
plies to jet self-sharpening by PV mixing in the jet flanks. 
In section [TU] we show via a specific example how a process 
for which AM must always be negative can nevertheless 
result in jet-core acceleration. Section [TT] goes on to prove 
a much more general result. For the shallow-water model, 
PV mixing anywhere on one or both the flanks of a jet 
must always accelerate the jet core, provided that the jet 
is zonally symmetric both before and after mixing. 

Section [12] briefly discusses the possibility of extend- 
ing these results beyond quasigeostrophic to more accurate 
models. So far, we have failed to find such extensions. Ob- 
stacles to progress include the nonlinearity of accurate PV 
inversion operators. In the concluding remarks, section [T31 
we touch on the implications for models of geophysical tur- 
bulence. In particular, our results underline the need to 
pay closer attention to the angular-momentum budget in 
such models. 

2. Definition of generalized mixing 

As well as ordinary diffusion-assisted mixing we want 
to include the limiting case of purely advective rearrange- 
ment, or pure stirring. All such cases, from pure stir- 
ring to partial mixing to perfect mixing, can be described 
as linear operations on the PV field. They are conve- 
niently represented in terms of a Gre en's function or in- 
tegra l kernel in the s tandard way (e.g., Pasquill and Smith 
1983] iFiedlerl Il984 IStullI 1 19841 : IPlumb and McConalogue 



19881 IShnirelmanlll993t iThuburn and Mclntvrdll997l: lEsler 
2008al ). Such Green's functions have properties akin to 
probability density functions, and are called bistochastic 
or doubly stochastic. The corresponding linear operators 
are sometimes called polymorphisms. 

The Green's function formalism is essentially the same 



for all the model systems under consideration, including 
those describing potential-energy changes. So it will suffice 
to restrict attention at first to the shallow- water case. For 
a general two-dimensional domain T>, let qi(x,y) be the 
initial PV distribution and qi(x,y) the PV distribution at 
some later time. Because of linearity and horizontal non- 
divergence we may write 



1i{x,y) = JJ dxdy q,(x ,y )r(x,y ;x,y) (2.1) 
where the kernel r satisfies the following three conditions: 



dxdyr(x', y';x,y) = 1 for all (x', y') G T> , (2.2) 



dx'dy' r(x, y'; x,y) = l for all (x, y) € V , (2.3) 
v 

and r{x',y';x,y) ^ for all (x, y), (x', y') e V , (2.4) 

but is otherwise arbitrary. Here we call r(x',y' ;x,y) the 
"redistribution function" defining the generalized mixing 
that takes place between the initial time and the later time. 
The condition (|2.2[) ensures that r(x\y' ; x,y) represents a 
conservative redistribution of PV substance in the sense 
that (|1.1|) is satisfied. To show this, integrate (|2.1|) with 
respect to x and y and then use (|2.2p to deduce (|1.1[) with 
Aq = qi—qi- The conditions (|2 . 3[) and (|2.4p ensure that, for 
given {x,y), qt{x, y) is a weighted average (with positive or 
zero weights) of the initial PV values qi (x', y'). This in turn 
ensures that generalized mixing cannot increase the range 
of PV values, and in particular that an initially uniform 
PV profile remains uniform. 

We may think of r(x',y' '; x,y) dx dy dx' dy' as the pro- 
portion of fluid transferred from area dx'dy' at location 
(x',y') to area dxdy at location (x,y). Here "fluid" has 
to be understood in a particular way. The notional fluid, 
or material, has to be the sole transporter of g-substance, 
whether by advection or by diffusion or otherwise. That is, 
we imagine that different amounts of q-substance are at- 
tached permanently to each fluid particle, so that, in par- 
ticular, the diffusivity of q is the same as the sclf-diffusivity 
of the notional fluid. The notional fluid is incompress- 
ible, as required by (|1.1[) . (|2.2[) . and the concept of self- 
diffusivity. 

The mathematical properties of the Green's fun ction 
operators are further discussed in IShnirelmanl (|l993l ). For 
instance, they form a partially-ordered semigroup. The 
partial ordering corresponds to successive mixing events. 

For the PV-mixing problem we are mainly interested 
in a zonally symmetric domain \y'\ < L; and sections [5]- 
[5] will consider only zonally symmetric initial PV profiles, 
qi(y'). The PV distribution after generalized mixing may 
or may not be zonally symmetric. However, the angular- 
momentum change AM depends only on qi{y') and on the 
zonal or x average of qi(x,y), denoted q~t(y). It is conve- 
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nicnt to define 



R(y', y) ■= I dx' r(x', y'; x, y) 



(2.5) 



the overbar again denoting the average with respect to x 
(not x'). The zonal average of (|2.1[) is then 



m(y) = J^dy' qi (y')R(y',y) , (2.6) 
where J dyR(y',y) = l for all y' £ [-L, L] , (2.7) 

J dy'R(y',y) = 1 for all y£ [-L,L] , (2.8) 

and R(y',y) > for al\y,y£ [-L, L] , (2.9) 

(|2~7 ) -([2~9" ]) being the counterparts of P ^M pO ]) . 

A redistribution function R representing pure diffusion 
is symmetric in the sense that R(y' , y) — R(y, y'). This fol- 
lows from the self-adjointness of the operator representing 
the divergence of a downgradicnt diffusive flux. It is some- 
times assumed that all redistribution functions are sym- 
metric, but that would be too restrictive for our purposes. 

Consider the examples of purely advective rearrange- 
ment in Fig. [2] The first two examples, with redistribution 
functions Ri(y' , y) and ^2(2/, y), are symmetric. They cor- 
respond to patterns in the (y , y) plane that are mirror- 
symmetric about the main diagonal, representing simple 
pairwise diffusionless exchanges of fluid elements. The third 
example depicts the effect of R\ followed by i?2, giving 



Qeiv) 



\y' [ L dy"q l (y")R 1 (y",y , )R 2 (y',y) 

L J-L 



(2.10) 



That is, the effect of Ri followed by R2 is described by the 
composite redistribution function 



R 2 °Ri (y", y) 



dy'R 1 {y",y')R 2 {y , ,y) , (2.11) 



which is asymmetric. It represents a cyclic permutation of 
three fluid elements and is the simplest kind of asymmetric 
redistribution function. To be completely general we need 
to include such cases and their elaborations. 

In section[5]and Appendix A we use the fact that purely 
advective rearrangements are reversible, hence described 
by invertible mappings. 

3. M in terms of q for shallow water 

For shallow-water beta-channel dynamics we may de- 
fine M as the total absolute zonal momentum per unit 
zonal (x) distance. Let the shallow-water layer have depth 
H — b(x, y, t) + h(x, y, t), where H is constant, h is the free 
surface elevation, b is the bottom topography, and 



h -C H, b -C H. We assume b = b(y). The fluctuat- 
ing part b(x, y,t) := b — b can provide a quasi-topographic 
form stress to change M and catalyze PV mixing, as may 
happen in Jupiter's stratified weather layer. We choose 
the Coriolis parameter to be a constant, /o, thus regarding 
the beta effect as due to the northward or y gradient of 
the zonally averaged bottom profile b(y), corresponding to 
the latitudinal gradient of Taylor-Proudman layer depth 
in the middle latitudes of a spherical planet. Let po be 
the constant mass density and m(x, y, t) the zonal velocity 
with u{y,t) its zonal average. Then to quasigeostrophic 
accuracy 



M = po J dy (H + h-b)(u- f y) 
= p H J dy (u- foy^-j^- - foy 
= p H I dy (u - f y^-^- I 



—L 



H 



(3.1) 
(3.2) 
(3.3) 



Introducing the quasigeostrophic stream function ip = gh/ fo 
and the Rossby deformation length Ljj = \fg~H / fo we have 



M = p H j dy 



dip , h-b 
dy H 



const. 



(3.4) 



p H J dy - L D 2 ip + f3yj y + const. , (3.5) 

where the first term has been integrated by parts. We have 
defined 



to - ^ 



(3.6) 



and assumed that the Phillips boundary condition holds, 
namely 

du _ d 2 ip 



dydt 

implying that the boundary term 

+L 



on y = ±L 



(3.7) 



PoH 



dib 
dy 



const. 



(3.8) 



The Phillips boundary condition is the standard way of 
stopping mass and a ngular momen tum from leaking across 
the side boundaries (|PhilliDslll95l . Denoting the variable 
part of M in (|3.5|) by M and defining q in the standard 
way, ignoring a contribution fo, as 



we have 



q := V 2 ?A - L D 2 ^ + f3y , 



M = p Q H j dyq(y)y 



(3.9) 
(3.10) 
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FlG. 2. Three redistribution functions of which the first two are symmetric and the third asymmetric. They represent purely 
advective rearrangements within the zone \y\ < L, with no diffusive smearing. Values are zero except on the black sloping lines, 
which represent Dirac delta functions, e.g. 8(y — y ) on the main diagonal y — y . The first two redistribution functions R\ and 
R2 describe simple exchanges of small but finite (strip-like) fluid elements. The composite rearrangement described by the third 
redistribution function R20R1 , see f|2. llf> . is a cyclic permutation among three fluid elements, a "three-cycle" in group-theoretic 
terminology. Notice that the off-diagonal delta functions line up with the gaps, or zeros, in the main diagonal. They line up both 
in the y direction and in the y direction so that both (|2.7|l and (|2.8p are satisfied. 



This expression has an alternative interpretation as the 
Kelvin impulse for the quasigeos trophic system, per unit 
zonal distance (e.g.. lBuhlerll2009| ). Initially 



M = Mi :=p Q Hl dy q t {y)y 



(3.11) 



At the later time after generalized mixing, the averaged q 
becomes qi = qi + Aq, so that 

M = Mi := pvH J dy q £ (y) y , (3.12) 

pL pL 

= Po H dy / dy' q i (y')R(y' > y)y , (3.13) 

J-L J-L 

= Mi + AM (3.14) 

say, with 

AM = Po Hj L dyAq(y)y, (3.15) 

pL pL 

= P0 H dy / dy' qi (y')AR(y',y)y , (3.16) 

J-L J-L 

where 

AR(y',y) := R(y' ,y) - 6{y' - y) , (3.17) 

the difference between the redistribution function R(y' \y) 
and the do-nothing redistribution function 5{y' — y). Here 
8 denotes the Dirac delta function. 

4. M in terms of q for other systems 

The relations in section [3] extend straightforwardly to 
the sphere and to a stratified quasigeostrophic system in a 
beta-channel. 

In the stratified system, with say a bottom boundary at 
pressure altitude z = zq, the PV is redistributed separately 
on each z surface, and the buoyancy acceleration f^dip/dz 
is redistributed on z = zq. Therefore, each altitude z has 
its own R and AR functions, R(y',y;z) and AR(y',y;z) 



say. To obtain a concise formulation we may de fine the PV 
to inc lude a delta function at z = zq following iBretherton 
(|l966h . 



Q(x,y;z) : = V 2 </> + 



1 d 



p (z) dz 



Po(z) 



Jo 2 di> 



N(z) 2 dz 



/o 2 

N(z ) 2 dz 



8(z - zq) + Py , 



.1) 



where po(z) is the background density, N(z) is the back- 
ground buoyancy frequency, and V 2 still denotes the hor- 
izontal Laplacian. If there is a rigid top boundary, then a 
further delta function can be added. The initial and later 
M values and the difference between them are now, respec- 
tively, 



Mi = 



Jdzp (z) J dy Qi(y;z)y , (4.2) 

p pL pL 

M e = dzp (z) dy dy' Q l (y';z)R(y',y;z)y , (4.3) 

J J-L J-L 



and 

AM = ldzp (z] 



L ,-L 

dy I dy' Q i {y';z)AR(y' 1 y;z)y 



-L 



-L 



(4.4) 



The contributions to M add up layerwise because PV in- 
version is a linear operation in quasigeostrophic dynamics. 

These relations also extend to a single layer on a sphere, 
provided that = 00 and that absolute zonal momentum 
per unit zonal distance is replaced by absolute angular mo- 
mentum per radian of longitude. Then the counterpart of 
(|3~T5)) is 



AM = poHa^J dp Aq(p) p 



(4.5) 



where a is the radius of the sphere, p := sin0 where <j> 
is the latitude, and q now denotes the absolute vorticity. 
The generalized-mixing conditions (|2.6[) - (|2.9[) and the for- 
mulae (|3.11|1 — (|3.17|) apply to the sphere provided that y is 
replaced by p, poH by poHa 4 , and ±L by ±1. 
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5. The basic theorem 

In this section we prove the basic theorem that AM is 
always negative for monotonically increasing qi (y ) and any 
nontrivial rearrangement function R such that integrals like 
(|3.16|) make mathematical sense, with values independent 
of the order of integration. The same proof will apply to the 
potential-energy problem, with zonal averaging replaced by 
horizontal area integration for general container shapes, as 
explained in section |B1 

Nontrivial means "do something" rather than "do noth- 
ing" : Ai? in (|3.17|) must be nonvanishing in an appropri- 
ate sense. More precisely, nontrivial means that R(y' \y) 
and AR(y',y) have nonvanishing off-diagonal values some- 
where, where those off-diagonal values have nonzero mea- 
sure in the sense that they can make nonzero contributions 
to integrals like (|3.16p . This in turn means that the nonva- 
nishing off-diagonal values must exist in some finite neigh- 
borhood, albeit possibly a neighborhood in the form of a 
line segment, as in the delta-function examples of Fig. [2] 

Equation (|3.16[) can be rewritten 



AM = p Q H dy' qi {y')n{y') 



where by definition 



v(y') 



dy AR(y',y)y . 



(5.1) 



(5.2) 



In virtue of (|3.17[) . rj(y') may be regarded as the average 
latitudinal displacement of fluid initially at y' . Denote the 
indefinite integral of r](y') by H(y') (cyrillic-style big Eta). 
Specifically, 



y' ry' rL 

dy"v(y") = d y" dyAR(y",y)y (5.3) 

-L J-L J-L 

y' rL 



dy" dyAR(y",y)(y~y') 

-L J-L 

- [ L dy"[ L dyAR(y" 7 y)(y-y') , 

Jy' J-L 



(5.4) 
(5.5) 



where the penultimate step uses (|2.7[) and (|3.17[) , implying 
that J^ L dyAR(y",y) = for all y", and the last step 

and (HH]), implying that J^ L dy" AR(y" \y) = for all y. 
The last step depends on interchangeability of the order of 
integration. From (|5.4p and (|5.5|) we see that H(— L) = 
= H(+L). Therefore (|5.1[) may be integrated by parts to 
give 



(5.6) 



So if, finally, for nontrivial i?, we can prove that H(y') 
is nonnegative for all values of y and nonvanishing with 




nonzero measure for at least some values of y' , then the 
theorem will follow. That is, (|5.6[) will then imply that 



(5.7) 



AM<0 if dqi(y')/dy' >0 for all y' , 



and vice versa. That is, the sign of AM must always be 
opposite to the sign of the initial monotonic PV gradient. 

To prove that H(y') is nonnegative we rewrite (|5.5[) . 
after changing the order of integration, as 

- fdy fdy"AR(y",y)\y-y'\ 

J-L Jy' 

-f L dyf L dy"AR{y",y)\y-y'\. (5.8) 
Jy' Jy' 

Again because J_ L dy" AR(y",y) = for all y, we may re- 
place Jy, dy" by — f^ L dy". Applying this to the second 
term only, we obtain an expression in which Ai? can be 
replaced by the nonnegative function i?, 

H(y') - ["dy [ L dy"R(y",y)\y-y'\ 

J-L Jy' 

+ f dy fdy"R{y",y)\y-y'\ , (5.9) 

Jy' J-L 

because there are no contributions from the main diagonal 
y — y" . For given y' , the two rectangular domains of inte- 
gration for (|5.9[) intersect each other and the main diagonal 
at a single point only, y = y" = y . (The two domains are 
mirror images of each other in the main diagonal.) At the 
point y = y" = y' the factor \y — y'\ is zero, annihilating 
any delta functions. Therefore H(y') is nonnegative. 

Now as y' runs from — L to L, the two domains sweep 
over the upper and lower triangles of the square — L ^ y ^ 
L, -L ^ y" ^ L, together covering the entire square. 
By definition, a nontrivial R function must have nonzero 
measure somewhere off the main diagonal, in some finite 
neighborhood of a location with \y\ ^ L, \y'\ ^ L, and 
y — y' =/= 0. Whichever moving domain encounters that 
location must continue to intersect it as y' runs through 
some finite range of values, implying that M(y') > over 
that finite range. So H(y') is not only nonnegative, but also 
nonvanishing with nonzero measure, for any nontrivial R, 
and the theorem follows. 

An alternative proof using an entirely different nonneg- 
ative function is given in Appendix A. 

6. Connection to available potential energy 

The basic theorem can alternatively be read as govern- 
ing the sign of the potential-energy change due to three- 
dimensional generalized mixing of a Boussincsq fluid within 
a fixed container in a uniform gravitational field. 
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Consider first a container with vertical walls. Then 
(|5.6I) carries over at once if we read qi as the buoyancy 
acceleration, y' as the altitude, the R and AR functions as 
applying to horizontal area averages, starting from a three- 
dimensional version of the r function in ()2. ![) — (|2.4|) . and 
AAI as proportional to minus the potential-energy change. 
Second, consider a container of arbitrary shape V as being 
embedded within the vertical-walled container. We merely 
extend the definition of r and hence of R and AR such that 
no generalized mixing takes place outside V. With this 
understanding (|5.6p still applies, and (|5.7[) follows. That 
is, if the initial state is undisturbed and stably stratified, 
with the same stratification at all horizontal positions (in- 
cluding those in any separate "abyssal basins"), then the 
potential-energy change is guaranteed to be positive for any 
nontrivial R whatever. This generalizes a standard result 
in the theory of available potential energy saying the same 
thing for a purely advective R (e.g.. lHollidav and Mclntvre 
198ll Appendix A below). 

We emphasize that the generalized result depends on 
having a linear equation of state, as is standard for Boussi- 
nesq models, since only then is the buoyancy acceleration 
a transportable, mixable quantity^ 

7. The physical meaning of H(y') 

Reverting to the PV interpretation, with y northward 
rather than upward, we consider the function U(y')/(L + 
y'). The definition ([O]) shows that VL(y')/(L + y') is the 
average northward displacement of all the notional fluid ini- 
tially south of y' . Equivalently, VL(y')/ (L + y') is the north- 
ward displacement of that fluid's centroid. This makes the 
nonncgativeness of VL more intuitively apparent. The cen- 
troid is initially as far south as it can be, and can therefore 
only move northward. We may reasonably call H(y') it- 
self the "area-weighted bulk displacement" of all the fluid 
initially south of y , or "bulk displacement function" for 
brevity. 

The fact that H(L) = expresses what can also, now, 
be seen to be intuitively reasonable, namely that there can 
be no bulk displacement of the entire zone — L ^ y' ^ L. 
The fluid has nowhere to go. Its centroid must remain 
fixed under any generalized mixing operation confined to 
the zone — L ^ y' ^ L. And the symmetry expressed by 
(15.91) says that we may equally well think of H(y') as the 
southward area-weighted bulk displacement of all the fluid 
initially north of y = y . 



For more general equations of state, especially those containing 
thermobaric terms, there is no straightforward concept of potential 
energy. As first shown by W. R. Young, the Boussincsq limit then 
needs reconsideration, and the consequences are nontrivial. It turns 
out that potential energy has to be replaced by a "dynamic enthalpy" 
that contai ns both grav itational and vestigial thermodynamic con- 
tributions (Young 2010). Such generalized Boussinesq models are 
outside our scope here. 




K(Z/') 



FlG. 3. Illustration of H(y') for a redistribution function that 
simply exchanges material between latitudes y\ and 1/2, as for 
instance in Fig. [2b- The finite slopes near j/i and j/2 are due to 
the finite widths of the fluid elements exchanged. The maximum 
value of H is 3/2 — yi ■ 



Figure [3] shows a simple example, the bulk displacement 
function H(y') corresponding to the R function shown in 
Fig-Ep- Nothing happens to the fluid south of y± and north 
of 2/2- However, there is, for instance, a northward bulk 
displacement of the fluid originally in (— L, y') whenever y' 
lies between y\ and 2/2 ■ The transitions across y\ and 7/2 
have small but finite widths, corresponding to the small but 
finite line segments in the off-diagonal regions of Fig. [5b- 

The foregoing applies of course to the potential-energy 
interpretation, with northward and southward replaced by 
upward and downward. 

8. Further implications, including generalized shear- 
instability theorems 



The result (|5.6[) carries over to DS08's case of a sphere 
with L]j = 00, with y replaced by /1, the sine of the latitude, 
and AM replaced by its spherical counterpart, the absolute 
angular momentum increment (|4.5p . as noted at the end 
of section 2J And (|5.6p also carries over to the stratified 
systems of section |H with the factor p n H replaced by a 
vertical integration and q by Q as in (|4.1[) - (|4.4[) . Therefore, 
the basic theorem ()5.7|) holds in any case for which there 
are monotonic profiles of Q on each of the levels subject 
to mixing, provided that all the gradients dQ/dy have the 
same sign including the gradients of the Brethcrton delta 
function or functions. 

It is worth noting the implications of such cases for 
the theory of quasi geostrophic shear instabilit y, in particu- 
lar the theorems of ICharnev and Ster 3 (EH) and lArnol'dl 
(Il965h . These theorems in their original forms apply only 
to nondiffusive Hamiltonian dynamics, and therefore only 
to purely advective rearrangements. The basic theorem 
(|5.7I) generalizes the Charney-Stern theorem and a case of 
Arnold's first stability theorem — which we call "Arnold's 
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zeroth stability theorem", or "the Arnol'd theorem" for 
brevity — to cover finite-amplitude disturbances with arbi- 
trary amounts of PV mixing. The Arnol'd theorem in ques- 
tion is the nonlinear counterpart of the Rayleigh-Kuo the- 
orem, rather than the Fj0rtoft theorem of which Rayleigh- 
Kuo is a special case. 

In instability problems there are no external sources or 
sinks of absolute angular momentum. Growing instabilities 
exchange angular momentum purely internally, through ra- 
diation or diffraction stresses. This is possible, the basic 
theorem tells us, only if there are regions in which the q or 
Q gradients have different signs. Conversely, whenever the 
q or Q gradients are nonzero and all of one sign, instability 
is impossible. These are exactly the circumstances in which 
the Charney-Stern theorem and the Arnol'd theorem were 
originally proved for purely advectivc rearrangements, and 
can now be proved, using (|5.7|) . for the far more general re- 
distributions defined in section[2] which include PV mixing. 

The proof runs as follows. We start with q = qi{y), or 
Q = Qi(y) on each level. An initial finite-amplitude distur- 
bance is set up advectively, by undulating the PV contours. 
To do so requires artificial forcing. This is because of the 
hypothesis that the q or Q gradients are nonzero and all of 
one sign. By (|5.7|) . M must change by some nonvanishing 
amount AM during the setup. 

We then let the system run freely. The free dynami- 
cal evolution may include wave breaking and PV mixing 
- going beyond Hamiltonian evolution. PV invcrtibility 
implies that the free evolution can be fully described by 
specifying a succession of PV distributions. Equivalently, 
therefore, the free evolution can be described by a succes- 
sion of R functions operating on q = Qi(y) or Q = Qi(y). 
Each such function is the composite of two R functions, the 
purely advective R function describing the initial setup and 
one of the general R functions describing the subsequent 
free evolution. 

The free evolution keeps AM constant. Since (|5.6|) or 
its Q counterpart, vertically integrated as necessary, is sign- 
definite by hypothesis, then cither it or its negative qualifies 
as a Lyapunov function (from R functions to nonncgativc 
real numbers), whose constancy under free evolution im- 
plies neutral nonlinear stability. This is the generalized 
Arnol'd's zeroth theorem. 

We may remark that the sign-definite function (| A3|) 
below also qualifies as a Lyapunov function, vertically in- 
tegrated as necessary, providing an alternative proof. 

9. Nonmonotonic, zonally asymmetric qi 

The basic theorem (|5.7[) applies to zonally symmetric 
and monotonic qi{y') only. This is the most important 
case, but it may be of interest to note what can be proved 
for more general initial conditions qi(x , y ). 

Consider a pair of PV distributions qi(x',y'), q 2 (x',y') 



that can be derived from each other by purely advective, 
and therefore reversible, rearrangement. That is, 

3a (x, y) = J J dx'dy' q 1 (x',y')s(x',y',x,y) (9.1) 

and rr 

Qi(x,y) = jj^dx'dy' q 2 (x',y')s(x,y,x',y') , (9.2) 

where the redistribution function s describes an invertiblc 
mapping. 

For given s, consider the set of all possible redistribu- 
tion functions r together with the set of all possible com- 
posites ros. Because of reversibility, the set of all r must 
be the same as the set of all r o s. Therefore the set of all 
possible Mi values that can result from applying the r's to 
an initial PV distribution q\ must be the same as the set 
of all possible Mg values from applying the r's to an initial 

For a general initial q±(x ,y) we can always find an 
advective rearrangement s such that q2 is a monotonically 
increasing function of y alone (Appendix B). Denote that 
function by q 2 [y; qi(-)]- The corresponding M value is 

M 2 [(?!(■)] = f dj/to [y';qi(-)]y' . (9.3) 

The basic theorem of section[5]restricts the possible Mi val- 
ues that can be attained starting from q 2 [y\ q\{-)\- Specifi- 
cally, 

M e M 2 (9.4) 

The same argument applies to the monotonically decreas- 
ing case. Because the y origin is in the center of the y 
domain, the resulting q 2 function is simply q 2 [— y; 
and the corresponding M value is — M 2 [qi(-j\. In sum- 
mary, identifying qi(x',y') with our general initial condi- 
tion qi(x ,y), we now have 

-M 2 [ qi (.)] < M e < M 2 [<?;(•)] . (9.5) 

That is, the two possible extreme values of Me correspond 
to the two extreme, monotonically decreasing or increas- 
ing, zonally symmetric profiles into which qi(x',y') can be 
advectively rearranged. 

10. The simplest jet-resharpening problem 

Consider the following shallow-water thought experi- 
ment in an unbounded domain, L = oo. We begin with 
a perfectly sharp jet, with concentrated PV gradients at 
its core (solid curves in Fig.0]). First, the concentrated PV 
gradients are smeared out, decelerating the jet and decreas- 
ing the absolute angular momentum M (dotted curves in 
Fig.@|. Second, the PV is mixed on both sides of the jet, 
resharpening and accelerating it (dashed curves in Fig.@|. 
Perhaps countcrintuitively, the basic theorem (15. 7[) implies 



8 



10 



y 



L 



D 



-10 




-1.0-0.50.0 0.5 1.0 0.00.20.40.60.81.0 

q/q s u/q s L D 

FlG. 4. The jet-resharpening thought experiment. An ini- 
tial PV profile in the form of a step function (solid curves) 
is smeared diffusively (dotted curves). This smeared profile is 
then resharpened by mixing PV on the flanks of the jet (dashed 
curves). Here, as throughout this paper, "mixing" entails con- 
servation of PV substance (|l.ip . 

that M must decrease further, at this second stage, even 
though the jet core accelerates. Let us look at what hap- 
pens in more detail. 

Consider the quasigcostrophic shallow- water system with 
the initial PV profile in the form of a step of size 2q s , 



(y' > o) 
(y' < o) 



Inversion gives the familiar velocity profile 



Ui(y') 



q s L D exp [ - — 



(10.1) 



(10.2) 



shown by the solid curve in Fig.HjD. After the first stage, 
in which the concentrated gradients are smeared out, the 
PV profile is taken in error-function form 



9i(y) 



2q s 
Ly^ Jo 



dy exp 



L 2 



(10.3) 



shown by the dotted curve in Fig.^i, for which the length- 
scale L has been taken as 2Lrj. Inversion gives the cor- 
responding smeared velocity profile, shown by the dotted 
curve in Fig- H]o, as 



ui{y) 



Lpq s 



dy exp - 



\y-y\ 



L 



D 



[cf. (Ti~l~2j) ff.]. The change in M due to the PV redistribu- 
tion in this first stage is 



AMi 



dy(qi - qi)y 



q s L 2 



(10.5) 



as can be verified from an integration by parts. 

After the second stage, the PV has been perfectly mixed 
on either side of the jet core (dashed curves) out to fringes 
at around \y\ = AL_d, say, where A ^> 1. In the figure, 
we have taken A = 10. Within the two perfectly mixed 
regions, the resharpened PV distribution is 



Q2 (y') = 



q s - 5q s {y' > 0) 
(q s - Sq s ) (y' < 0) 



where 



5q s = (XL D ) 1 (q s L/^) < q s 



(10.6) 
(10.7) 



This assumes fringes antisymmetric about y = ±\Lp, as 
well as total PV conservation, Eq. (|1.1[) , and neglect of the 
Gaussian tails in (| 10 . 3[) for y ^> Ljj. The corresponding 
resharpened velocity profile is 



ui{y') = (q s - 6q s ) Ldcxp(- \y'\ /L D ) , 



(10.8) 



provided that the peripheral fringes have length scales ^ 
Ld- (Narrower peripheral fringes, not 3> Id, would invert 
to give two extra jets, albeit weak ones.) 

The change in M due to the PV redistribution in the 
second stage is 



/XLd 
dy {q2~q l )y 



(10.9) 



Because A > 1, |AM 2 | > |AMi|. In this example, M not 
only decreases at each stage, as the basic theorem says it 
must, but the decrease is far greater at the second stage, 
even though the jet core still accelerates. The total change 
AM over both stages is 



AM = AMi + AM 2 = -q s L 2 X 



This can also be written 



AM 



q s L 2 ( Sq s 



(10.10) 



(10.11) 
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As Sq s /q s decreases, the PV profile q 2 returns closer and 
closer to the initial PV profile qi while AM becomes in- 
creasingly large and negative. There is an increasingly 
large cost associated with mixing far from y = 0. In the 
case of Fig. SI Sq s /q s m 0.11. Furthermore AM\ = —2q s L D 
and AM 2 « ~9q s L 2 D . Ii5q a /q s were decreased to 0.01, then 
AM2 would become w — \2bq s L 2 D . 

11. General jet sharpening 

Consider a more general shallow-water thought exper- 
iment, now starting from a general monotonic PV profile 
qi{y')- For definiteness, we take the monotonically increas- 
ing case dqi(y')/dy' > 0. 

We suppose that generalized mixing takes place except 
that there is no mixing across a particular material contour 
initially at latitude y = yo- That is, the contour behaves 
as an eddy-transport barrier. The contour may undulate 
during the mixing, but we assume that it straightens out af- 
terwards and returns to latitude yo , consistent with quasi- 
gcostrophic, area-preserving advection. The net effect of 
the mixing can then be described by a nontrivial zonally- 
averaged redistribution function R{y' , y) = R{y' ', y; yo), re- 
call (1231). such that 



We have 



R(y', y; vo) = o if y < yo < y' or y' <y < y. (11.1) 

It will be proved that, in this thought experiment, for finite 
Ld, the net change Au(y ) in the zonal-mean zonal flow 
at y = yo is always positive. 

In particular, we may choose the material contour y = 
yo to be in the core of a jet. So mixing on one or both 
flanks of a jet must always accelerate the straightened- out 
jet core, regardless of the details of the mixing provided 
only that the jet core has persisted, throughout, as an eddy- 
transport barrier. Mixing could be confined, for instance, 
to locations arbitrarily far from the jet core, though of 
course the resulting Au(y ) would then be small. 

Differentiating the expression for q in p.9j) with respect 
to y, and taking the zonal average, we obtain the inversion 
problem for the change Au{y) in u{y) due to an arbitrary 
change Aq(y) in q(y), 



dy 2 



Au(y) 



dAgjy) 
dy 



(11.2) 



This can be solved with the Green's function G(y,yo) de- 
fined by 



d 2 

dy~2~ l d z ) G (y> yo) = -% - yo) 



(11.3) 



with G(y, yo) vanishing on the boundaries y ± L to satisfy 
the Phillips boundary condition (|3.7[) . The proof will apply 
both to finite and to infinite L (though Lp has to be finite) . 



/ dyG(y,yo)^-Aq{y) , (11.4) 



-L 



dy 



as can be verified by subtracting G(y, yo) times (|11.2j) from 
Au(y) times (|11.3|) and integrating with respect to y. Tak- 
ing A q = J_ L dy'qi(y')A R(y', y; y ), with AR defined by 
(|3.17p . we may integrate (111.4[) by parts to give 

Aw(yo) =- J dy' qi(y')fj(y', yo) (11.5) 



where by definition 

v(y',yo) 



-L 



dyAR(y',y;yo)Y(y,y ) (11.6) 



with Y(y, y ) := dG(y, yo)/dy. Now let 



W,yo) ~ / dy"r,(y",yo) . (11.7) 



The reasoning below (|5.2[) - ([5.5p applies word-for-word to 
the functions rj and H, after replacing the right-hand fac- 
tors y and y - y' in ((5T2J) dUS]) by Y(y, y ) and Y(y, y )- 
Y(y , yo) respectively, yo being fixed throughout. It follows 
that VL(-L,y ) = = VL{+L,y ). Integrating (fiTB"]) by 
parts, we therefore get a result analogous to (|5.6[) . 



(11.8) 




We now use the eddy-transport-barrier assumption (jll.ip . 
The assumption says that R(y',y;yo) and AR(y',y;yo) 
have a block diagonal structure in the y'y plane, with non- 
vanishing values confined to two diagonal blocks meeting 
&ty = y' = yo- If y' > yo, then nonvanishing contributions 
to H(y',yo) come from the upper right block only, and if 
y' < yo from the lower left only. Within each block Y(y, yo) 
is a monotonically increasing function of y, as will be shown 
next, implying that sgn{ Y(y, y )-Y(y', y )} = sgn(y - y'). 
It will then follow that H(j/', yo) is given by the right-hand 
side of (HU) with \y - replaced by \Y(y,y )-Y(y' ,y )\, 
proving not only that H ^ but also that Au(yo) > 
when dqi(y')/dy' > 0, in the same way as below (|5.9[) . 

Because the reasoning below (|5.9[) involves a finite neigh- 
borhood in the y'y plane, it is enough to prove mono- 
tonicity in the interior of each block, more specifically that 
dY(y,y )/dy > 0, equivalent^ 8 2 G(y, y )/dy 2 > 0, for 
y =/= yo and y =/= ±L. It is here that we need the finiteness 
of L D . 

Consider the graph of G(y,yo) as a function of y, in 
each block y < yo and yo < y separately. From (|11.3|) the 
second derivative satisfies 

d 2 G(y, y )/dy 2 = L D 2 G(y, y ) for y ^ y . (11.9) 
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For finite Lb the graph is therefore convex toward the y 
axis everywhere except at y = yo and y = ±L. Because the 
graph goes to zero at both boundaries y = ±L, it can have 
only the one extremum at y = yo. The jump condition 
from (IP) . 



dG(y,y ) 



dy 



v=yo+ 



v=ya- 



1 



(11.10) 



ensures that the extremum is a maximum. Therefore G(y, yo) 
must be positive everywhere apart from the boundaries 
y = ±L and therefore, from ()11.9j) . 

3 2 G(y, y )/dy 2 > for all y ^ y , ±L . (11.11) 

This completes the proof. We have established that, for 
both finite and infinite L, the velocity change Au(yo) in 
the straightened-out jet core satisfies 



Au(y ) > if dq t (y')/dy' > for all y' , 



(11.12) 



and vice versa, for any nontrivial R that preserves the eddy- 
transport barrier at the jet core. 

It is not clear whether there is an alternative proof anal- 
ogous to that of Appendix A. The counterpart of the last 
term of (|A4[) no longer makes a vanishing contribution to 
the counterpart of (|A3I) . 

12. Beyond the present models? 

It might be thought that the beta-channel results should 
extend to the full sphere for finite as well as for infinite L d . 
However, such an extension would be far from straightfor- 
ward, if only because the standard quasigeostrophic theory 
relies on Ld being constant. Hence for finite Ld the re- 
sults are valid only to the extent that the beta channel 
is valid, namely, in a zone that is narrow relative to the 
planetary radius a and sufficiently far from the equator. A 
remaining challenge, therefore, is to make progress beyond 
the restrictions of quasigeostrophic theory and nondivcr- 
gent barotropic theory, Ld = oo. 

Could there be an exact counterpart to the basic theo- 
rem (|5.7p ? The question makes sense at least for thought- 
experiments having a zonally symmetric final as well as 
initial state, with both states in exact cyclostrophic bal- 
ance. Then PV invertibility tells us that there is an exact 
counterpart to the question "what is the sign of the abso- 
lute angular-momentum change that results from general- 
ized PV mixing?" Here "exact" indicates not only exact 
cyclostrophic balance but also use of the exact (Rossby- 
Ertcl) PV. 

The conservation a nd impermeability theorems satis- 
fied by the exact PV ( Havnes and Mclntvre 1990() guar- 
antee that the distinction between generalized mixing and 



unmixing is still clear. "Particles" of PV-substance or PV- 
charge (of either sign) can be thought of as being trans- 
ported along isentropic surfaces, but never across them, 
even when diabatic heating is significant. Hence the up- 
gradient transport involved in unmixing means that PV- 
substance is transported against its isentropic gradient. 
Furthermore, even though the first-moment formula (|3.10j) 
fails, the total absolute angular momentum is still well de- 
fined, and exactly defined. 

An exact counterpart to the basic theorem (|5.7j) would 
therefore make sense as a conjecture. However, we have so 
far failed to prove any such exact theorem. So the ques- 
tion remains open for now. The main technical obstacle 
appears to be the nonlinearity of the exact cyclostrophic 
PV inversion operator. 

13. Concluding remarks 



The basic theorem (|5.7|) proved here underlines the point 
that, especially in problems of jet formation and mainte- 
nance, as well as in "beta-turbulence" problems in general, 
it is advisable to consider the angular-momentum budget 
as well as the enstrophy and energy budgets. The theorem 
underlines another fundamental point as well, namely that 
thought-experiments in which one imagines "stirring" the 
fluid to mix the PV are not well defined until one speci- 
fies what is doing the stirring. Artificial body forces will 
in general cause some unmixing of PV. So too will im- 
mersed bodies such as Welander's massless goldfish (P. B. 
Rhines 1971, personal communication), which produce vor- 
tex quadrupoles and are therefore capable of extending the 
range of PV values. Indeed massless goldfish, by defini- 
tion, cannot change the absolute angular momentum. The 
goldfish might therefore produce profiles like those studied 
in DS08 and illustrated in Fig.QJ: above. 

Another motivation for this work was to advance our 
understanding of Jupiter's weather layer. An adequate rep- 
resentation of what we observe on the real planet will un- 
doubtedly require a coupled model of the weather layer and 
the underlying convection zone. The convection zone is, in 
turn, bounded below by a strongly stratified transition to 
metallic hydrogen, as the pressure increases and the pro- 
portion of ionized hydrogen atoms to neutral atoms and 
H2 molecules builds up with temperature. It is likely that 
Richardson numbers in the transition zone are enormous. 
So it may well be that one can treat the transition zone as a 
rigid but perfectly slippery boundary, whose only function 
is to supply heat from below. 

Our current aim is less ambitious, namely to isolate 
one aspect of the coupling between the top of the convec- 
tion zone and the overlying weather layer, by making the 
simplifying assumption that the main effect of the convec- 
tion zone is to exert the fluctuating form stress required 
to catalyze PV mixing and jet formation. For instance 
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the form stress can be exerted via an artificial "heaving 
topography" b(x, y, t) acting as the forcing function on a 
shallow-water layer, in place of the usual artificial body 
forces. Arguably, the addition of such quasi-topographic 
fo rcing migh t impr ov e the reali sm of simulations like that 



owma 3 (|2007h . IShowmanl also avoids using artificial 



body forces, but assumes that the sole effect of the convec- 
tion zone is to produce small-scale mass injections into the 
weather layer, like thunderstorm anvils. 

A further question is whether, with a more natural and 
realistic forcing, we can reach a statistically steady state 
without having to invoke large-scale Rayleigh friction or 
hypodiffusion, both of which are hardly natural assump- 
tions for a planet with no nearby solid surface. 

These questions are as yet unanswered but we hope to 
make progress on them soon, through numerical experi- 
ments based on sophisticated numerical codes that as far 
as possible respect the angular momentum principle. 
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APPENDIX A 
An alternative proof 

The connection to potential energy noted in section [6] 
suggests an alternative proof of the basic theorem, via a 
mathematical route quite different from that of section [5j 
It is motivated by positive-definite exact formulae for po- 
tential-energy changes that are already known for purely 
advective rearrangem e nts, of buoyanc y (e.f 
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These exact formulae are now recognized as cases of the 
energy-Casimir and momentum-Casimir formulae arising 
in Hamiltonian models of d isturbances to nontrivial initia l 
or background states (e.g., Arnol'dl 1965 : Shepherd] 1993 ). 
The resulting proof of (|5.7j) can be seen as a nontrivial gen- 
eralization of the Hamiltonian theory, made possible by the 
i?-function formalism. 

For a purely advective rearrangement, the Hamiltonian 
formulae apply. In the shallow-water case, for instance, we 
have 

(Al) 



AM = - Po H dyA(y,v) 



where f] is the latitudinal displacement of a fluid element 
expressed as a function of its final latitude y rather than 



its initial latitude y', so that rj(y) = ij{y') = y — y' ', and 
where the function A is defined by 



My, v) 



dfj 



t 9q t {y 



dy 



— V 1 



(A2) 



It is only because of the invertible mapping between the 
initial latitude y and final latitude y of a given fluid cle- 
ment, in the purely advective case, that we can write the 
displacement of that element as a function either of y or 
of y. 

We now show that the i?-function formalism allows us 
to redcrive (|A1[) together with its generalization beyond 
the Hamiltonian framework, as a single expression 




First, we see by inspection that in the purely advective 
case, for which R(y' , y) = 6 (y — y — fj(y)), the expression 
(|A3|) does reproduce (|A1|) . Second, to see that (|A3[) is 
correct for a general R function, we rewrite (|A2[) by sub- 
stituting y* := y — if and integrating by parts to obtain 

My, y-y') = -qi{y'){y - y') - /V Qi(y f ) • (A4) 

Jy 



Now the last term of (|A4[) contributes nothing to (|A3[) . 
This is because it has the functional form a(y) — a(y'). In 
virtue of the integral constraints (|2.7[) and (|2.8[) . the con- 



tribution to (|A3|) is j^ L J L L dy dy' R(y' ,y) [a(y) — a(y')] = 

f_^ L a(y) dy — f_^ L a(y') dy' = 0, for any function a(-) . 

The definition (|5.2p of the average displacement rj(y') 
can be rewritten using (|2.7[) and (|3.17j) as 



V (y') = J dyAR(y',y)(y-y') 

dyR(y',y)(y-y r ) . (A5) 

-L 



Hence by substituting the first term of (|A4I) into (|A3[) . then 
using (|A5j) to rewrite the result in terms of r\{%) ), we see 
that (|A3j) is equivalent to the original expression (|5.1j) for 
AM. The basic theorem (|5.7|) now follows, because (|A2j) 
shows that the function A is positive definite whenever 
dqi/dy is positive, and negative definite whenever dqijdy 
is negative. 

APPENDIX B 
Monotonizing Q 

To see how to obtain monotonically increasing q2 [y ; 3i(0] 
from the general qi(x' , y') by advective rearrangement, one 
may proceed as follows. 
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The function describing the monotonic PV distribution 
1 = 1i [y; <7i(-)] w ill have an inverse function y = y 2 [q- qi(-)]. 
For a given q value, all the fluid with q\ > q will, after 
rearrangement, lie between y = yi [<?; qi (■)] and the northern 
boundary y = L. Hence we may define 



«a[«;3i(0] :=L- J dyH{q 1 (x,y)~q} , (Bl) 

where % is the Heaviside step function and the ovcrbar 
again denotes averaging in x. The redistribution function 
representing the advective rearrangement from qi(x',y') to 

s[x',y',x,y;qi(-)] = J^j s {v - 2/2 [?i (a:', y')\ qi(-)} }, (B2) 

where S is the Dirac delta function. Equation (|B2[) can be 
verified by by substituting this s into (|9.1[) . 
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